Search CORE

536 research outputs found

MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers

Author: Chen Chen
Li Sijia
Lu Haonan
Publication venue
Publication date: 08/09/2023
Field of study

Diffusion-model-based text-guided image generation has recently made astounding progress, producing fascinating results in open-domain image manipulation tasks. Few models, however, currently have complete zero-shot capabilities for both global and local image editing due to the complexity and diversity of image manipulation tasks. In this work, we propose a method with a mixture-of-expert (MOE) controllers to align the text-guided capacity of diffusion models with different kinds of human instructions, enabling our model to handle various open-domain image manipulation tasks with natural language instructions. First, we use large language models (ChatGPT) and conditional image synthesis models (ControlNet) to generate a large number of global image transfer dataset in addition to the instruction-based local image editing dataset. Then, using an MOE technique and task-specific adaptation training on a large-scale dataset, our conditional diffusion model can edit images globally and locally. Extensive experiments demonstrate that our approach performs surprisingly well on various image manipulation tasks when dealing with open-domain images and arbitrary human instructions. Please refer to our project page: [https://oppo-mente-lab.github.io/moe_controller/]Comment: 5 pages,6 figure

arXiv.org e-Print Archive

中国の中小企業に求められる経営戦略 : 浙江省の民営中小企業を中心として

Author: Sijia Chen
陳思佳
Publication venue
Publication date: 31/03/2014
Field of study

Kwansei Gakuin University Repository